PageRank model of opinion formation on social networks 
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We propose the PageRank model of opinion formation and investigate its rich properties on real 
directed networks of Universities of Cambridge and Oxford, LiveJournal and Twitter. In this model 
the opinion formation of linked electors is weighted with their PageRank probability. We find that 
the society elite, corresponding to the top PageRank nodes, can impose its opinion to a significant 
fraction of the society. Ifowever, for a homogeneous distribution of two opinions there exists a 
bistability range of opinions which depends on a conformist parameter characterizing the opinion 
formation. We find that LiveJournal and Twitter networks have a stronger tendency to a totalitar 
opinion formation. We also analyze the Sznajd model generalized for scale-free networks with the 
weighted PageRank vote of electors. 

PACS numbers: 05.10.-a,89.20.-a,89.75.-k 



I. INTRODUCTION 



To understand the nature and origins of mass opinion 
formation is the outstanding challenge of democratic so- 
cieties [ij. In a last few years an enormous development 
of such social networks as LiveJournal , Facebook ^2,] , 
Twitter [i] and VKONTAKTE with up to hundreds 
of millions of users, demonstrated the growing influence 
of these networks on social and political life. The small- 
world scale-free structure of the social networks (see e.g. 
00])' combined with their rapid communication facil- 
ities, leads to a very fast information propagation over 
networks of electors, consumers, citizens making them 
very active on instantaneous social events. This puts for- 
ward a request for new theoretical models which would 
allow to understand the opinion formation process in the 
modern society of XXI century. 

The important steps in the analysis of opinion for- 
mation have been done with the development of var- 
ious voter models described in a great detail in Q, 
[il,[iiLM. This research field became 
known as sociophysics ] In this work we in- 



troduce several new aspects which take into account the 
generic features of the social networks. At first, we an- 
alyze the opinion formation on real directed networks 
taken from the Academic Web Link Database of British 
universities networks 16| , LiveJournal database and 
Twitter dataset [l^]- This allows us to incorporate the 
correct scale-free network structure instead of unrealistic 
regular lattice networks, often considered in voter mod- 
els [l3, 14 1 . At second, we assume that the opinion at a 



given node is formed by the opinions of its linked neigh- 
bors weighted with the PageRank probability of these 
network nodes. We think that this step represents the 
reality of social networks: all of network nodes are char- 
acterized by the PageRank vector which gives a probabil- 
ity to find a random surfer on a given node as described 
in [l3, HO]- This vector gives a steady-state probabil- 
ity distribution on the network which provides a natural 



ranking of node importance, or elector or society member 
importance. In a certain sense the top nodes of PageR- 
ank correspond to a political elite of the social network 
which opinion influences the opinions of other members 
of the society |l| . Thus the proposed PageRank Opinion 
Formation (PROF) model takes into account the situ- 
ation in which an opinion of an influential friend from 
high ranks of the society counts more than an opinion 
of a friend from lower society level. We argue that the 
PageRank probability is the most natural form of ranking 
of society members. Indeed, the efficiency of PageRank 
rating is demonstrated for various types of scale-free net- 
works including the World Wide Web (WWW) [ll,!!^, 



Physical Review citation network [21 
nal rati ng 

articles 12511 . the world trade network 



scientific jour- 
ranking of tennis players |2J], Wikipedia 

2611 and others. 



Due to the above argument we consider that the PROF 
model captures the reality of social networks and below 
we present the analysis of its interesting properties. 

The paper is composed as follows: the PROF model 
is described in Sec. II, the numerical results on its prop- 
erties are presented in Sec. Ill for British University net- 
works. In Sec. IV we combine the PROF model with 



the Sznajd model [13|,|27[ and study the properties of the 
PROF-Sznajd model. In Sec.V we analyze the models 
discussed in previous Sections on an example of large so- 
cial network of the LiveJournal [l^ • The results for the 
Twitter dataset are presented in Sec. VI. The discus- 
sion of the results is presented in Sec. VII. 



II. PAGERANK OPINION FORMATION (PROF) 
MODEL DESCRIPTION 



The PROF model is defined in the following way. In 
agreement with the standard PageRank algorithm [2^ 
we determine the PageRank probability Pt for each node 
i and arrange all N nodes in a monotonic decreasing or- 
der of the probability. In this way each node i has a 
probability P{Ki) and the PageRank index Ki with the 
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maximal probability is at Ki = 1 (X^i^i = !)• We 

use the usual damping factor value a = 0.85 to compute 
the PageRank vector of the Google matrix of the network 
(see e.g. [ll,!!^],!!!!!!)- In addition to that a network 
node i is characterized by an Ising spin variable Ui which 
can take values +1 or —1 coded also by red or blue color 
respectively. The sign of a node i is determined by its di- 
rect neighbors j which have the PageRank probabilities 
Pj. For that we compute the sum over all directly 
linked neighbors j of node i: 



j,out 



(1) 



j.out ' 



a + b 



where Pj^in and Pj.out denote the PageRank probability 
Pj of a node j pointing to node i (incoming link) and a 
node j to which node i points to (outgoing link) respec- 
tively. Here, the two parameters a and b are used to tune 
the importance of incoming and outgoing links with the 
imposed relation a + b = 1 (0 < a, 6 < 1). The values P+ 
and P~ correspond to red and blue nodes respectively. 
The value of spin (T^ takes the value 1 or —1 respectively 
for Si > or < 0. In a certain sense we can say that 
a large value of parameter b corresponds to a conformist 
society where an elector i takes an opinion of other elec- 
tors to which he points to (nodes with many incoming 
links are in average at the top positions of PageRank). 
On the opposite side a large value of a corresponds to 
a tenacious society where an elector i takes mainly an 
opinion of those electors who point to him. 



interacting spins: 

= JyCTiCTj = -^SiCTj , (2) 



where the spin-spin interaction Jij determines the local 
magnetic field Bi on a given node i 



Bi = V(ai'j,m + bPj^out)(Jj 



j 



(3) 



which gives the local spin energy e, = —BiUi. Accord- 
ing to ([2]), ([S]) the interaction between a selected spin i 
and its neighbors j is given by the PageRank probability: 



aPn 



bP. 



j^out ■ 



Thus from a physical view point 



the whole system can be viewed as a disordered ferromag- 
net 14 1 . In this way the condition (jlJ corresponds to 
a local energy e; minimization done at zero temperature. 
We note that such an analogy with spin systems is well 
known for opinion formation models on regular lattices 
[i3]>[i31j0- However, it should be noted that generally 
we have asymmetric couplings Jij ^ Jji that is unusual 
for physical problems (see discussion in (30|). In view of 
this analogy it is possible to introduce a finite tempera- 
ture T and then to make a probabilistic Metropolis type 
condition 31| for the spin i inversion determined by a 
thermal probability pi = exp(— Ae^/T), where Ae; is the 
energy difference between on-site energies with spin 
up and down. During the relaxation process each spin is 
tested on inversion condition that requires N steps and 
then we do t iterations of such N steps. We discuss the 
results of the relaxation process at zero and finite tem- 
peratures T in next Section. 



III. NUMERICAL RESULTS FOR PROF MODEL 
ON UNIVERSITY NETWORKS 



FIG. 1: (Color online) Time evolution of opinion given by a 
fraction of red nodes f{t) as a function of number of iterations 
t. The red and black curves (top and bottom curves at f = 15 
respectively) show evolution for two different realizations of 
random distribution of color with the same initial fraction 
fi — 0.5 at t = 0. The green curve (middle curve at t — 15) 
shows dependence f{t) for the initial state with Ntop all red 
nodes with top PageRank K indexes (highest P{Ki) values, 
1 < K < Ntop)- The evolution is done at a = & = 0.5 and 
temperature T = 0. Left panel : Cambridge network with 
Ntop = 2000. Right panel : Oxford network with Atop = 1000. 



The condition ([1} on spin inversion can be written via 
the effective Ising Hamiltonian H of the whole system of 



Here we present results for PROF model considered on 
the networks of Cambridge and Oxford Universities in 
year 2006, taken from [l^. The properties of PageRank 
distribution P{K) for these networks have been analyzed 
in 2^ 2^ . The total number of nodes N and links Nf^ are: 



N = 212710, Nt = 2015265 (Cambridge); N = 200823, 
Ni = 1831542 (Oxford) Both networks are char- 

acterized by an algebraic decay of PageRank probability 
P{K) oc and approximately usual exponent value 

/3 « 0.9, additional results on the scale-free properties 
of these networks are given in (2^, [2^ . We discuss usu- 
ally the fraction of red nodes since by definition all other 
nodes are blue. 

The typical examples of time evolution of fraction of 
red nodes f{t) with the number of time iterations t are 
shown in Fig. [TJ We see the presence of bistability in 
the opinion formation: two random states with the same 
initial fraction of red nodes fi = fit = Q) evolve to two 
different final fractions of red nodes //. The process gives 
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ank probability P{K) and that the correlations between 
CheiRank and PageRank are not very strong (see discus- 
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FIG. 2: (Color online) Time evolution of opinion colors 
(red/gray and blue/black) for the parameters of Fig. [1] 
left/right column is for Cambridge/Oxford network. The ini- 
tial fraction of red colors is fi — 0.5 (top panel) and Ntop 
nodes have red color for bottom panels with Ntop = 2000 and 
1000 for Cambridge and Oxford network respectively. Nodes 
are ordered by the PageRank index K and color plot shows 
only K < 20000. 



an impression of convergence to a fixed state approxi- 
mately after tc ~ 10 iterations. A special check shows 
that all node colors become fixed after this time tc- The 
convergence time to a fixed state is similar to those found 
for opinion formation on regular lattices where tc = 0(1) 
Ulii,!!!. The corresponding time evolution of colors 
is shown in Fig. [2] for first 10% of nodes ordered by the 
PageRank index K. 

The results of Fig. [1] show that for a random initial 
distribution of colors we may have different final states 
with ±0.2 variation compared to the initial fi — 0.5. 



However, if we consider that Ntop nodes with the top K 
index values (from 1 to Ntop) have the same opinion (e.g. 
red nodes) then we find that even a small fraction of the 
total number of nodes N (e.g. Ntop of about 0.5% or 1% 
of N) can impose its opinion for a significant fraction of 
nodes of about // w 0.4. This shows that in the frame 
of PROF model the society elite, corresponding to top 
K nodes, can significantly infiuence the opinion of the 
whole society under the condition that the elite members 
have the fixed opinion between themselves. 

We also considered the case when the red nodes are 
placed on Ntop = 2000 top nodes of CheiRank index K*. 
This ranking is characterized by the CheiRank probabil- 
ity P*{K*) for a random surfer movin g in the inverted 
direction of links as described in [2^, 129| . In average 
P*{K*) is proportional to the number of outgoing links. 
However, in this case the top nodes with a small fi val- 
ues are not able to impose their opinion and the final 
fraction becomes blue. We attribute this to the fact that 
the opinion condition ([T]) is determined by the PageR- 



FIG. 3: (Color online) Density plot of probability Wf to find 
a final red fraction //, shown in y— axis, in dependence on 
an initial red fraction fi, shown in x— axis; data are shown 
inside the unit square < fi, ff < 1- The values of Wf are 
defined as a relative number of realizations found inside each 
of 20 X 20 cells which cover the whole unit square. Here Nr = 
10^ realizations of randomly distributed colors are used to 
obtained Wf values; for each realization the time evolution is 
followed up the convergence time with up to f = 20 iterations; 
here T = 0. Left column: Cambridge network; right column: 
Oxford network; here a = 0.1,0.5,0.9 from top to bottom. 
The probability Wf is proportional to color changing from 
zero (blue/black) to unity (red/gray). 

To analyze how the final fraction of red nodes // de- 
pends on its initial fraction fi we study the time evolution 
f{t) for a large number N^ of initial random realizations 
of colors following it up to the convergent time for each 
realization. We find that the final red nodes are homoge- 
neously distributed in K. Thus there is no specific pref- 
erence for top society levels for an initial random distri- 
bution. The probability distribution Wf of final fractions 
ff is shown in Fig. [3] as a function of initial fraction fi at 
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three values of parameter a. These results show two main 
features of the model: a small fraction of red opinion is 
completely suppressed if fi < fc and its larger fraction 
dominates completely for > 1 — f^, there is a bistabil- 
ity phase for the initial opinion range fb ^ fi ^ ^ — fb- 
Of course, there is a symmetry in respect to exchange 
of red and blue colors. For small value a = 0.1 we have 
fb ~ fc with fc « 0.25 while for large value a = 0.9 we 
have fc « 35, fb « 0.45. 
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FIG. 4: (Color online) Dependence of the final fraction of red 
nodes // on the tenacious parameter a (or conformist param- 



eter b - 



for initial red nodes in Nto 



2000 values of 



PageRank index (1 < A" < Ntop)'-, black and red(gray) curves 
show data for Cambridge and Oxford networks; here T — 0. 

Our interpretation of these results is the following. For 
small values of a — > the opinion of a given society mem- 
ber is determined mainly by the PageRank of neighbors 
to whom he points to (outgoing links). The PageRank 
probability P of nodes, on which many nodes point to, is 
usually high since P is proportional to the number of in- 
going links [i^]- Thus, at a — >■ a society is composed of 
members who form their opinion listening an elite opin- 
ion. In such a society its elite with one color opinion 
can impose this opinion to a large fraction of the society. 
This is illustrated on Fig. Uwhich shows a dependence of 
final fraction of red // nodes on parameter a for a small 
initial fraction of red nodes in the top values of PageRank 
index {Ntop — 2000). We see that a = corresponds to 
a conformist society which follows in its great majority 
the opinion of its elite. For a = 1 this fraction ff drops 
significantly showing that this corresponds to a regime 
of tenacious society. It is somewhat surprising that the 
tenacious society (a —J' 1) has well defined and relatively 
large fixed opinion phase with a relatively small region of 
bistability phase. This is in a contrast to the conformist 
society at a — )■ when the opinion is strongly influenced 
by the society elite. We attribute this to the fact that 
in Fig. [3] we start with a randomly distributed opinion, 
due to that the opinion of elite has two fractions of two 




FIG. 5: (Color online) Same as in Fig. [3] (middle panels) 
at a = 0.5 but with uniform condition for spin flip being 
independent of PageRank probability (top panels: P — 1 
in Eq. U])) and PageRank probability P replaced by \/P in 
Eq.([T]) (bottom panels); left and right panels correspond to 
Cambridge and Oxford networks; here T — and A'^^ ~ 
realizations are used. 



colors that create a bistable situation since two fractions 
of society follows opinion of this divided elite that makes 
the situation bistable on a larger interval of fi compared 
to the case of tenacious society at a — > 1. 

To stress the important role of PageRank in the depen- 
dence of // on fi presented in Fig. [3] we show in Fig. [S] 
the same analysis at a = 0.5 but for the case when in 
Eq.([T]) for the spin flip we take all P = 1 (equal weight 
for all nodes) . The data of Fig. [5] clearly demonstrate 
that in this case the bistability of opinion disappears. 
Thus the PROF model is qualitatively different from the 
case when only the links without their PageRank weight 
are counted for the spin flip condition. We also test the 
sensitivity in respect to PageRank probability replacing 
P by ^/P in Eq.([T]) as it is shown in Fig. [5] (bottom pan- 
els) . We see that compared to the case P = I we start to 
have some signs of bistability but still they remain rather 
weak compared to the case of Fig. |31 

In fact the spin flip condition ([T]) can be viewed as a 
relaxation process in a disordered ferromagnet (since all 
Jij > in ^) at zero temperature. Such type of 
analysis of voter model relaxation process on regular lat- 
tices is analyzed in f3|, |fj| . From this view point it is nat- 
ural to consider the effect of finite temperature T on this 
relaxation. At finite T the flip condition is determined 
by the thermal Metropolis probability exp(— Ae^/T) as 
described in previous Section. We follow this thermo- 
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However, the relaxation process at finite temperatures 
does not lead to the thermal Boltzmann distribution. In- 
deed, in Fig.[7]we show the probability distribution 'Wi{ei) 
as a function of local energies defined in ©, ([3]). The 
distribution Wi{ei) is obtained from the relaxation pro- 
cess with many initial random spin realizations Nr . Even 
if the temperature T is comparable with typical values of 
local energies ei we still obtain a rather peaked distribu- 
tion at ei « being very different from the Boltzmann 
distribution. 




FIG. 6: (Color online) Same as in Fig. |3] (middle panel) at a = 
0.5 but at finite temperature T during the relaxation process 
with T = 0.001 (top panels) and T = 0.01 (bottom panels); 
the number of random initial realizations is Nr = 6000, the 
relaxation is done during t = 200 iterations. Left and right 
columns correspond to Cambridge and Oxford networks. 



dynamic relaxation process at finite temperature up to 
t — 200 iterations and in this way obtain the probabil- 
ity distribution of final // fraction of red nodes obtained 
from initial fi fraction of red nodes randomly distributed 
over the network at t = 0. The results obtained at finite 
temperatures are shown at Fig. |6l They show that a fi- 
nite temperature T allows to have a finite fraction of red 
nodes ff when for their small initial fraction fi all final 
// were equal to zero. Also the bistability splitting is re- 
duced and it disappears at larger values of T. Thus finite 
T introduce a certain smoothing in Wf distribution. 
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FIG. 7: (color online) Normalized histograms of probability 
distribution Wi over local energies ei obtained from the relax- 
ation process during t — 10^ time iterations at temperatures 
T — 0.01 (black curve) and T — 0.05 (red/gray curve); aver- 
age is done over A*',. = 200 random initial realizations. The 
insets show the distributions on a large scale including all lo- 
cal energies e^. Left and right panels show Cambridge and 
Oxford networks. 



FIG. 8: (Color online) Diagram shows the final fraction of 
red nodes ff, coded by color from ff — (black) to = 1 
(blue/dark gray), as a function of initial fraction of red nodes 
fi and the total initial energy Ei; each of A'^,- > 3.5 x 10* 
random realizations is shown by color point; data are shown 
after t — 20 time iterations at T = 0. The energy £"0 is the 
modulus of total energy with all spin up; here a = 0.5. Left 
and right panels show data for Cambridge {Eo — 341.20) and 
Oxford (Eq = 254.28) networks; bars show color attribution 
to final probability //. 



We argue that a physical reason of significantly non- 
Boltzmann distribution is related to the local nature of 
spin flip condition which does not allow to produce a good 
thermalization on a scale of the whole system. Indeed, 
there are various energetic branches and probably non- 
local thermalization flips of group of spins are required 
for a better thermalization. However, the voting is a lo- 
cal process that involves only direct neighbors that seems 
to be not sufficient for emergence of the global thermal 
distribution. The presence of a few energy branches is 
well visible from the data of Fig. |5] obtained at T = 0. 
This figure shows the diagram of final fraction // of red 
nodes in dependence on their initial fraction fi and the 
total initial energy Ei = X]m=i '^^ whole system 
corresponding to a chosen initial random configuration 
of spins. Most probably, these different branches prevent 
efficient thermalization of the system with only local spin 
flip procedure. In addition to the above points the asym- 



metric form of Jij couplings plays an important role gen- 
erating more complicated picture compared to the usual 
image of thermal relaxation (see e.g. [30|). We also note 
that the thermalization is absent in voter models on reg- 
ular lattices lisll. 
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IV. PROF-SZNAJD MODEL 

The Sznajd model [2^ nicely incorporates the well- 
known trade union principle "United we stand, divided 
we fall" into the field of voter modeling and opinion for- 
mation on regular networks. The review of various as- 
pects of this model is given in 13[. Here we generalize 



the Sznajd model to include in it the features of PROF 
model and consider it on social networks with their scale- 
free structure. This gives us the PROF-Sznajd model 
which is constructed in the following way. For a given 
network we determine the PageRank probability P{Ki) 
and PageRank index Ki for all i nodes. After that we 
introduce the definition of group of nodes. It is defined 
by the following rule applied at each time step r: 

• i) pick by random a node i in the network and con- 
sider the polarization of the Ng — 1 highest PageR- 
ank nodes pointing to it; 

• ii) \i the node i and all other Ng—1 nodes have 
the same color (same spin polarization) , then these 
Ng nodes form a group whose effective PageRank 
value is the sum of all the member values Pg = 

S jii Pj ! if it is ^ot the case then we leave the 
nodes unchanged and perform the next time step; 

• in) consider all the nodes pointing to any member of 
the group (this corresponds to the model option 1 ) 
or consider all the nodes pointing to any member of 
the group and all the nodes pointed by any member 
of the group (this corresponds to the model option 
2); then check all these nodes n directly linked to 
the group: if an individual node PageRank value P„ 
is less than Pgroup then this node joins the group by 
taking the same color (polarization) as the group 
nodes; if it is not the case then a node is left un- 
changed; the PageRank values of added nodes are 
then added to the group PageRank Pgroup and the 
group size is increased. 

The above time step is repeated many times during time 
r, counting the number of steps, by choosing a random 
node i on each next step. This procedure effectively 
corresponds to the zero temperature case in the PROF 
model. 

A typical example of the time evolution of the fraction 
of red nodes /(r) in the PROF-Sznajd model is shown in 
Fig. [9l It shows that the system converges to a steady- 
state after a time scale Tc ss 107V that is comparable 
with the convergence times for the PROF models stud- 
ied in previous Sections. We see that there are still some 
fluctuations in the steady-state regime which are visibly 
smaller for the option 2 case. We attribute this to a 
larger number of direct links in this case. The number 
of group nodes Ng gives some variation of // but these 
variations remain on a relatively small scale of a few per- 
cents. Here, we should point on the important difference 
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FIG. 9: (Color online) Time evolution of the fraction of red 
nodes /(r) in the PROF-Sznajd model with the initial frac- 
tion of red nodes fi = 0.7 at one random realization. The 
curves show data for three values of group size Ng = 3 
(blue/black); 8 (green/light gray); 13 (red/gray). Full/dashed 
curves are for Cambridge/Oxford networks; left panel is for 
option 1; right panel is for option 2. 



between PROF and PROF-Sznajd models: for a given 
initial color realization, in the first case we have conver- 
gence to a fixed state after some convergence time while 
in the second case we have convergence to a steady-state 
which continue to fluctuate in time, keeping the colors 
distribution only on average. 

The dependence of the flnal fraction of red nodes // on 
its initial value fi is shown by the density plot of prob- 
ability Wf in Fig. Uni (option 1 of PROF-Sznajd model). 
The probability Wf is obtained from many initial ran- 
dom realizations in a similar way to the case of Fig. [S] 
We see that there is a significant difference compared to 
the PROF model (Fig. |3]): now even at small values of fi 
we find small but finite values of // while in the PROF 
model the red color disappears at fi < fc- This fea- 
ture is related to the essence of the Sznajd model: here 
even small groups can resist against totalitar opinion. 
Other features of Fig. [10] are similar to those found for 
the PROF model: we again observe bistability of opin- 
ion formation. The number of nodes Ng , which form the 
group, does not affect significantly the distribution Wf, 
we have smaller fiuctuations at larger Ng values but the 
model works in a stable way already at Ng = 3. The re- 
sults for the option 2 of PROF-Sznajd model are shown 
in Fig. 111! In this case the opinions with a small initial 
fraction of red nodes fi are suppressed in a significantly 
stronger way compared to the option 1. We attribute this 
to the fact that large groups can suppress in a stronger 
way small groups since the outgoing direct links are taken 
into account in this option. 

The significant difference between the two options of 
PROF-Sznajd model is well seen from the data of Fig. [12] 
Here, all Ntop nodes are taken in red (compare with the 
PROF model in Fig. [4]) . For the option 1 the society elite 
succeeds to impose its opinion to a significant fraction 
of nodes which is increased by a factor 5-10. Visibly, 
this increase is less significant than in the PROF model. 
However, for the option 2 of PROF-Sznajd model there is 
practically no increase of the fraction of red nodes. Thus, 




FIG. 10: (Color online) PROF-Sznajd model, option 1: den- 
sity plot of probability Wf to find a final red fraction //, 
shown in y— axis, in dependence on an initial red fraction 
fi, shown in x~ axis; data are shown inside the unit square 
< fi,ff < 1- The values of Wf are defined as a relative 
number of realizations found inside each of 100 x 100 cells 
which cover the whole unit square. Here Nr = lO** realiza- 
tions of randomly distributed colors are used to obtained Wf 
values; for each realization the time evolution is followed up 
the convergence time with up to r = lO'^ steps. Left col- 
umn: Cambridge network; right column: Oxford network; 
here Ng = 3, 8, 13 from top to bottom. The probability Wf 
is proportional to color changing from zero (blue/black) to 
unity (red/gray). 

in the option 2 the society members are very independent 
and the influence of the ehte on their opinion is very 
weak. 



V. PROF MODELS ON THE LIVEJOURNAL 
NETWORK 

Even if one can expect that the properties of University 
networks are similar to those of the real social networks 
it is important to analyze the previous PROF models in 
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FIG. 11: (Color online) Same as in Fig. [TO] but for PROF- 
Sznajd model, option 2. 



the frame of real social network. For that wc use the 
LiveJournal network collected, described and presented 
at [17]. From this database we obtain the directed net- 
work with N = 3577166 nodes, Ni = 44913072 links 
which are mainly directed (only about 30% of links are 
symmetric). The Google matrix of the network is con- 
structed in a usual way [2^ and its PageRank vector is 
determined by the iteration process at the damping fac- 
tor a = 0.85. For the time evolution of fraction of red 
nodes / we use time iterations in t and t defined as in 
previous Sections. 

The PageRank probability decay P{K) is shown in 
Fig. [T31 It is well described by an algebraic law P{K) cx 
l/Rf^ with /3 = 0.448 ± 0.000046. The convergence of a 
fraction of red nodes f{t) takes place approximately on 
the same convergence time scale tc ^ 5 ^ 0{1) even if 
the size of the networks is increased almost by a factor 
20. 

In a way similar to the University networks we find that 
the homogeneous opinion of society elite presented in a 
small fraction of Ntop nodes influences a large fraction 




FIG. 12: (Color online) Time evolution of the fraction of 
red nodes /(r) in the PROF-Sznajd model with the ini- 
tial red nodes for the top PageRank nodes: Ntop = 200 
(blue/black); 1000 (green/light gray); 2000 (red/gray); here 
Ng = 8. Full/dashed curves are for Cambridge/Oxford net- 
works; left panel is for option 1; right panel is for option 2. 
Color of curves is red, green, blue from top to bottom at 
maximal r on both panels. 
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FIG. 13: (Color online) Data for LiveJournal network. Left 
panel: PageRank probability decay with PageRank index K 
(full curve), the fitted algebraic dependence is shown by the 
dashed line y = b ~ fix (for 1 < logj^g K < 5.5) with the 
exponent P = 0.448±0.000046 and b = -3.70±0.00023. Right 
panel: Time evolution of opinion given by a fraction of red 
nodes f{t) as a function of number of iterations t (cf Fig. [T|) 
at a = 0.5, few random initial realizations with fi — 0.5 are 
shown. 

of the whole society especially when the parameter a is 
not very large (see Fig. [TJ] in comparison with Fig. U]) . 
The influence of the elite at 1% of red nodes is larger in 
the case of LiveJournal network. It is possible that this 
is related to a 30% larger number of links but it is also 
possible that other structural network parameters also 
play a role here. 

In spite of certain similarities with the previous data 
for university networks discussed before we find that 
the opinion diagram for the LiveJournal network (see 
Fig. [14] right panel) is very different from those obtained 
for the University networks (see Fig. [3]): the bistability 
practically disappeared. We think that this difference 
originates from a significantly slower decay exponent for 
PageRank P{K) in the case of LiveJournal. To check 
this assumption we compare the probability distribution 
Wf of final opinion ff for an initial opinion fixed at 
fi = 0.4 using the PROF model with the usual linear 
weight P in Eq.([T]) and a quadratic weight proportional 
to (see Fig. [T5)l . For the linear weight we find that 
only very small values of // ~ 0.005 can be found for ini- 
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FIG. 14: (Color online) Data for LiveJournal network. Left 
panel: Dependence of the final fraction of red nodes // on the 
tenacious parameter a (or conformist parameter 6 = 1 — a) 
in the PROF model for initial red nodes in Ntop values of 
PageRank index {I < K < Ntop, cf Fig. g]). Here, Ntop = 
2000 blue; 10000 green, 35000 red curves (from bottom to top 
at a = 0.5); T — 0. Right panel: Same data as in Fig. Oat 
a = 0.5 with same parameters but for LiveJournal network. 

tial fi = 0.4 while for the quadratic weight we obtain a 
rather broad distribution of // values in the main range 
< // < 0.15 with a few large values // « 0.6. Thus we 
see that the final opinion is rather sensitive to the weight 
used in Eq.([T]). However, in contrast to the University 
networks (see Fig. ElFig. [5]), where we have narrow one 
peak or double peak distributions of // , for the LiveJour- 
nal network with quadratic weight we find a rather broad 
distribution of //. In the spirit of a renormalization map 
description considered in [l0| (see Figs. 1,2 there), it is 
possible to assume that one or two peaks corresponds to 
one or two fixed points attractor of the map. We make a 
conjecture that a broad distribution as in Fig. [13] (right 
panel) can correspond to a regime of strange chaotic at- 
tractor appearing in the renormalization map dynamics. 
In principle, such a chaotic renormalization dynamics is 
known to appear in coupled spins lattices when three- 
spin couplings are present (see [33j and Refs. therein). It 
is possible that a presence of weight probability associ- 
ated with the PageRank in a certain power may lead to 
a chaotic dynamics which would generate a broad distri- 
bution of final opinions //. 

We also made tests for the PROF-Sznajd model (op- 
tion 1) for the LiveJournal database. However, in this 
case at fi = 0.4, a = 0.5 we found only small ff values 
(similar of those in Fig. [TS] left panel) both for linear 
and quadratic weights in Eq.([T]). It is possible that the 
Sznajd groups are less sensitive to the probability weight. 



VI. PROF MODELS FOR THE TWITTER 
DATASET 

We also analyzed the opinion formation on the Twitter 
dataset with N = 41652230, Nt = 1468365182 taken 
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FIG. 15: (Color online) Data for LiveJournal network, prob- 
ability distribution Wf of final opinion // for a fixed initial 
opinion fi = 0.4 and a = 0.5 in the PROF model. Left 
panel: usual linear weight P{K) in Eq.([T]). Right panel: a 
quadratic weig ht P'^{K) in Eq.©. Histo grams are obtained 
with Nr — 500 initial random realizations, the normalization 
is fixed by the condition that the sum of Wf over all histogram 
bins is equal to unity. 



from jl8|. This size is rather large and due to that we 
present only main features of the PROF model for this 
directed network. 




FIG. 16: (Color online) Data for Twitter network. Left panel: 
PageRank probability decay with PageRank index K (full 
curve), the fitted algebraic dependence is shown by the dashed 
line y = b — Px (for 1 < logj^Q K < 5.5) with the exponent 
/3 = 0.511 ± 0.0021 and b = -3.33 ± 0.0069 (for the range 
5.5 < log^gK < 7 we find /3 — 1.23). Right panel: Depen- 
dence of the final fraction of red nodes // on the tenacious 
parameter a (or conformist parameter & = 1 — a) in the PROF 
model for initial red nodes in Ntop values of PageRank index 
{1<K< Ntop, cf. Fig.H Fig.[g. Here, Ntop = 1200 (blue 
line at // = 0); 1250 (red curve with circles), 1300 (top gree 
line); T = 0. 

The dependence of PageRank P on its index K is 
shown in Fig. [1^] (left panel). For the range 1 < logj^Q K < 
5.5 we find the decay exponent /3 « 0.51, being similar to 
those of LiveJournal network (see Fig. [T3)) even if there 
is a faster drop of P at larger K values. We note that the 
value /? ~ 0.5 is rather different from the value usually 
found for the Zipf law [si] and the WWW Q with ^ w 1. 
It is possible that this is related to a significantly larger 
average number of links per node which is increased by a 
factor 3.5 for the Twitter network compared to the Uni- 
versity networks analyzed in the previous Sections. 

The effect of the homogeneous elite opinion of all red 
Ntop nodes is shown in Fig. [TOl (right panel). We see that 



on the Twitter network a small fraction of elite with the 
fixed opinion {Ntop/N « 3 x 10~^) can impose it prac- 
tically to the whole community for all values of the con- 
formist parameter 1 — a. We find that for Ntop > 1300 all 
// values are very close to unity, while for Ntop < 1200 we 
find // = as it is seen in Fig. right panel. Thus, the 
transition is very sharp. We attribute such a strong in- 
fiuence of elite opinion to the very connected structure of 
Twitter network with a significantly larger average num- 
ber of links per node comparing to the University and 
LiveJournal networks. 

At a = 0.5, for a fixed fraction of initial opinion 
/,; = 0.4, we find that the probability distribution Wf 
of final opinion ff is located in the range of small values 
0.0006 <Wf< 0.0007 both for linear P and quadratic 
P^ weight used in Eq.(IT]) (we do not show these data). 
For the linear weight the situation is rather similar to the 
case of LiveJournal (see Fig. [T5)) . but for the quadratic 
weight we find a significant difference between two net- 
works (see Fig. [T5|) . The reason of such a significant 
difference for the quadratic weight case requires a more 
detailed comparison of network properties. 

The large size of Twitter network makes numerical sim- 
ulations of the PROF-Sznajd model rather heavy and due 
to that we did not study this model for this network. 



VII. DISCUSSION 

In this work we proposed the PageRank model of opin- 
ion formation of social networks and analyzed its prop- 
erties on example of four different networks. For two 
University networks we find rather similar properties of 
opinion formation. It is characterized by the important 
feature according to which the society elite with a fixed 
opinion can impose it to a significant fraction of the so- 
ciety members which is much larger than the initial elite 
fraction. However, when the initial opinions of society 
members, including the elite, are presented by two op- 
tions then we find a significant range of opinion fraction 
within a bistability regime. This range depends on the 
conformist parameter which characterizes the local as- 
pects of opinion formation of linked society members. 
The generalization of the Sznajd model for the scale-free 
social networks gives interesting examples of opinion for- 
mation where finite small size groups can keep their own 
opinion being different from the main opinion of the ma- 
jority. In this way the proposed PROF-Sznajd model 
shows that the totalitar opinions can be escaped by small 
sub-communities. We find that the properties of opinion 
formation are rather similar for the two University net- 
works of Cambridge and Oxford. However, the results 
obtained for networks of LiveJournal and Twitter show 
that the range of bistability practically disappears for 
these networks. Our data indicate that this is related to 
a slower algebraic decay of PageRank in these cases com- 
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pared to the University networks. However, the deep rea- 
sons of such a difference require a more detailed analysis. 
Indeed, LiveJournal and Twitter networks demonstrate 
rather different behavior for the P^-weighted function of 
opinion formation. The studies performed for regular 
networks [13] show existence of stable or bistable fixed 
points for opinion formation models that have certain 
similarities with the opinion formation properties found 
in our studies. At the same time the results obtained 
in [s^l show that three-body spin coupling can generate 
a chaotic renormalization dynamics. Some our results 
(Fig. [151 right panel) give indications on a possible exis- 
tence of such chaotic phase on the social networks. 

The enormous development of social networks in a few 
last years [2hi] definitely shows that the analysis of opin- 
ion formation on such networks requires further investi- 
gations. This research can find also various other appli- 
cations. One of them can be a neuronal network of brain 
which represents itself a directed scale- free network 35] . 
The applications of network science to brain networks is 
now under a rapid development (see e.g. [s^) and the 
Google matrix methods can find useful applications in 
this field ;3^. 

This work is supported in part by the EC FET Open 
project "New tools and algorithms for directed network 
analysis" (NADINE No 288956). We thank A.Bencziir 
and S.Vigna for providing us a friendly access to the Live- 
Journal database [l^l and the Twitter dataset [3]. 
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